PCT 



WORLD INTELLECTUAL PROPERTY ORGANIZATION 
International Bureau 




INTERNATIONAL APPLICATION PUBLISHED UNDER THE PATENT COOPERATION TREATY (PCT) 



| (51) International Patent Classification 5 : 
1 C07K 15/04, C12N 15/62 



Al 



(11) International Publication Number: WO 95/00555 

(43) International Publication Date: 5 January 1995 (05.01.95) 



(21) International Application Number: PCT/EP94/02088 

(22) International Filing Date: 28 June 1994 (28.06.94) 



EP 



(30) Priority Data: 

931 10298.2 28 June 1993 (28.06.93) 

(34) Countries for which the regional or 

international application was filed: DE et al. 



(71) Applicant (for all designated States except US): EUROPEAN 

MOLECULAR BIOLOGY LABORATORY (EMBL) 
[DE/DE]; Meyerbofstrasse 1, D-69117 Heidelberg (DE). 

(72) Inventor; and 

(75) Inventor/Applicant (for US only): STEWART, Francis 
[AU/DE]; Pfairgasse 7, D-69251 Gaiberg (DE). 

(74) Agents: WHCKMANN, H. et al.; Koperaikusstrasse 9, D- 
81679 Muncben (DE). 



(81) Designated States: AH AT, AU, BB, BG, BR, BY. CA, CH, 
CN, CZ, DE, DK, ES, FI. GB, GE, HU, JP, KE, KG, KP, 
KR, KZ, LK, LU, LV, MD, MG, MN, MW, NL, NO, NZ, 
PL, FT, RO, RU. SD. SE, SI SK, TJ, TT, UA, US, UZ, 
VN, European patent (AT, BE, CH. DE, DK, ES. FR, GB, 
GR, IE, TT, LU, MC, NL, FT, SE), OAPI patent (BF. BJ, 
CF, CG, a CM GA, GN, ML, MR, NE, SN, TD, TG). 



Published 

With international search report. 

Before the expiration of the time limit for amending the 

claims and to be republished in the event of the receipt of 

amendments. 

With an indication in relation to a deposited microorganism 
furnished under Rule 75*" separately from the description. 
Date of receipt by the International Bureau: 

23 September 1994 (23.09.94) 



(54) Title: REGULATION OF Srr&SPEOFIC RECOMBINATION BY SITE-SPECIFIC RECOMBINASE/NUCLEAR RECEPTOR 
FUSION PROTEINS 



(57) Abstract 

An invention is described that permits- the regulation of recombination in cells, organisms or appropriate cell-free systems. The 
invention involves creating fusion proteins between recombinase proteins, or components of recombinase systems, and ligand binding 
domains derived from nuclear receptors. The fusion proteins show little recombinase activity in the absence of the ugand that binds to the 
ligand binding domain. Upon binding of the Ugand, recombinase activity is induced The invention provides a practical means to regulate 
recombination in cells and organisms and. by linking Ugand binding to recombination, provides a simple means whereby ugand binding 
can be measured as recombination achieved. 



FOR THE PURPOSES OF INFORMATION ONLY 

Codes used to identify States party to the PCT oo the front pages of pamphlets publishing international 
applications under the PCT. 



AT 


Auitru 


GB 


United Kingdom 


MR 


A (J 


Australia 


GE 


Georgia 


MW 


BB 


Barbados 


GN 


Guinea 


NE 


BB 


Belgium 


GR 


Greece 


NL 


BF 


Burkina Fho 


HU 


Hungary 


NO 


BG 


Bulgaria 


IE 


Ireland 


NZ 


BJ 


Bcoin 


rr 


Italy 


PL 


BR 


Brazil 


JP 


Japan 


PT 


BY 


Belarus 


KE 


Kenya 


RO 


CA 


Canada 


KG 


Kyrgyttao 


RU 


CF 


Central African Republic 


KP 


Democratic People* • Republic 


SD 


CG 


Co 0(0 




of Korea 


SE 


CH 


Switzerland 


KR 


Republic of Korea 


SI 


CI 


C6te d'lvobe 


KZ 


Kazakhstan 


SK 


CM 


Cameroon 


U 


[ ifrtumrtfin 


SN 


CN 


Qxina 


LK 


Sri Lanka 


TD 


CS 


CzechoslftvafcU 


LU 


I^nembourg 


TG 


CZ 


Czecn Republic 


LV 


Latvia 


TJ 


DE 


Gennaay 


MC 


Monaco 


IT 


DK 


Deomark 


MD 


Republic of Moldova 


CA 


ES 


Spaia 


MG 


Madagascar 


US 


Fl 


Roland 


ML 


Mali 


TJZ 


FR 


Prance 


MN 


Mongolia 


VN 


GA 


Gabon 









Mauritania 

Malawi 

Niger 

Netherlands 

Norway 

New Zealand 

Poland 

PoraigaJ 

Romania 



Sudan 

Sweden 

Slovenia 

Slovakia 

Senegal 

Chad 

Togo 

Tajikistan 

Trinidad and Tobago 
Ukraine 

United State* of America 

Uzbekistan 

Viet Nam 



WO 95/00555 PCT/EP94/02088 

1 

Regulation of site-specific recombination by site-specific 
recombinase/nuclear receptor fusion proteins 
Specification 

The present invention relates to the technical field of genetic 
manipulation, and more specifically, to the use of recombina- 
tion-mediated DNA rearrangements and to the use of the regula- 
tory potential of nuclear receptors. 

The use of site-specific recombinases (SSRs) to induce defined 
rearrangements of DNA has been described in a variety of organ- 
isms (1-12) . These reports describe the introduction of a DNA 
construct that contains SSR target sites. Subsequent exposure 
to the SSR enzyme activity resulted in the DNA rearrangement 
determined by the disposition of the target sites (see refe- 
rence 13 for a recent review of SSRs) . Three SSRs have been 
used in this manner to date; FLP recombinase from the 2p epi- 
some of Saccharomyces cerevisiae (1,2,5,6,9,10), CRE recombi- 
nase from the Escherichia coli phage PI (3,4,8,11,12) and R 
recombinase from pSRl of Zygosaccharomyces rouxii (7) . Amongst 
other SSR systems relevant to the invention described here are 
those listed in references 13 and 14, and SSRs from Kluyveromy- 
ces drosophilarium (15), Kluyveromyces waltii (16), X Int (17) 
and the Gin recombination system from phage Mu (18) . The con- 
tent of the above document is incorporated by reference. 

For many applications in cells, organisms and cell-free in 
vitro systems, SSR induced DNA rearrangements must be regula- 
ted. Current implementation of the potential offered by SSRs is 
limited by the means available to regulate SSR activity. In 
experiments with cultured cells, unregulated SSRs have been 
used. For example, after introduction of SSR target sites into 
cells, recombination has been induced by subsequent introduc- 
tion of either FLP recombinase by transfection of DNA (4,9) or 
injection of CRE recombinase protein (12). That is, the inten- 
ded recombination event was regulated merely by the time of 
introduction of an appropriate macromolecule . Amongst other 
limitations, this precludes the creation and proliferation of 
homogeneous populations of cells that contain both the unrear- 
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ranged target sites and the SSR and in which the recombination 
event can be induced after cell numbers have been expanded. 

In experiments with transgenic animals, the issue of SSR regu- 
lation has been addressed by regulating the expression of an 
introduced SSR gene using the inducible heat -shock promoter in 
Drosophila (5) or a tissue-specific promoter in mice (11) . Both 
approaches have limited applicability. Namely, heat-shock regu- 
lation of transgene expression is currently only useful in 
flies and no suitable counterpart is available for use in cell 
lines or vertebrates. Also, the use of a tissue-specific promo- 
ter to regulate transgene expression relies on the limited 
availability of suitable promoters and enhancers and the ex- 
pression pattern achieved is confined to the times and places 
at which these tissue specific elements are active. 

The problem underlying the present invention was to provide a 
regulated recombination system, wherein the disadvantages of 
the prior art are at least partially eliminated. More specifi- 
cally, the problem was to provide a recombination system, 
wherein the recombination event can be induced independently 
from any tissue specific restrictions. 

This patent application describes an invention that regulates 
SSR activity, rather than its expression. Thus any means of 
achieving and directing expression can be used, such as using 
ubiquitous or broadly active promoters and enhancers, as well 
as tissue specific or inducible promoters and enhancers. 

One aspect of the present invention relates to a fusion pro- 
tein, comprising a recombinase protein or a component of a re - 
combinase complex, fused to part or all of a nuclear receptor, 
so that the amino acids that bind the ligand of said nuclear 
receptor are included, such that in cells or appropriate cell- 
free systems: (a) recombinase activity is inhibited in the ab- 
sence of ligand binding to said ligand binding domain and 
(b) recombinase activity is induced or altered by binding of 
ligand to said ligand binding domain. 
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Preferably the recombinase activity is altered by binding of 
ligand to ligand binding domain by a factor of at least 10, 
more preferably of at least 20 and most preferably of at least 
40. 

The present invention resides in the regulation of SSR activity 
by fusing an SSR protein to the ligand binding domain (LBD) of 
a nuclear receptor. SSR-LBD fusion proteins comprise an amino 
acid sequence of SSRs physically attached to the amino acid 
sequence of an LBD of a nuclear receptor. That is, the inven- 
tion is a fusion protein, comprising a recombinase protein, or 
a component of a recombinase complex, fused to the ligand bin- 
ding domain of a nuclear receptor. Preferably, the recombinase 
protein or a component of a recombinase complex is fused to the 
nuclear receptor or ligand binding domain thereof by means of 
genetic fusion, i.e. the SSR-LBD protein is a linear genetic 
fusion encoded by a single nucleic acid. On the other hand, the 
present invention also encompasses SSR-LBD fusion proteins 
which are linked by different means, e.g. through a spacer mo- 
lecule having reactive groups thereon, which are covalently 
bound to each protein domain. 

Most simply, the attachment of the SSR and LBD components can 
be achieved by making a DNA construct that encodes for the ami- 
no acid sequence of the SSR-LBD fusion protein with the LBD 
encoding DNA placed in the same reading frame as the SSR enco- 
ding DNA, preferably either at the amino or carboxy termini of 
the SSR protein (19) . More preferably, the nuclear receptor or 
ligand binding domain thereof is fused to the C-terminus of the 
recombinase protein or component of a recombinase complex. In 
an especially preferred embodiment of the present invention the 
nuclear receptor or ligand binding domain thereof is fused to 
the recombinase protein or component of a recombinase complex 
through a peptide linker which mainly consists of hydrophilic 
acids and preferably has a length of 4 to 20 amino acids. 

SSR-LBD fusion proteins can coexist with target sites without 
recombination occurring since these proteins require ligand 
binding to the LBD for recombinase activity. In our experi- 
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merits, recombinase activity of the SSR-LBD fusion protein, in 
the absence of the relevant ligand, is at least 200x less ac- 
tive than wild type recombinase activity. Upon presenting the 
SSR-LBD fusion proteins with the relevant ligand, recombinase 
activity is induced to more than 20% of wild type, that is, 
equal to or greater than 4 Ox induction. This is the first 
description of post-transcriptional regulation of SSRs by any 
means. In particular, the invention permits the propagation of 
both the unrearranged target sites and the SSR-LBD in living 
systems. More specifically: 

(a) cell lines can be established that carry both the unrear- 
ranged target sites and the SSR-LBD as a prerequisite for bio- 
chemical studies or introduction into organisms. The cell lines 
can be homogeneous and characterised before expansion to the 
required quantities and subsequent induction of recombination 
by administration of the relevant ligand, 

(b) recombination can be regulated in any experimentally mani- 
pulatable multicellular organism by administration of the rele- 
vant ligand. The SSR-LBD could be introduced to the organism 
either through the incorporation of cells or by direct means 
such as microinjection or in the genome of a viral based vec- 
tor. Recombination can be induced after characterisation by 
administration of the relevant ligand. 

The term "nuclear receptor" , as used herein, refers to a mole- 
cule, preferably a protein molecule, which may be glycosylated 
or unglycosylated, having the abilities to bind ligand and to 
be incorporated into a nucleus of a cell. Specifically, the 
term nuclear receptor refers to those proteins that display 
functional or biochemical properties that are similar to the 
functional or biochemical properties displayed by the steroid 
hormone receptors with respect to ligand binding, for example, 
the dioxin receptor (Whitelaw, M.L., G6ttlicher, M . , Gustafs- 
son, J. -A. and Poellinger, L., (1993) EMBO J. 12, 4169-4179). 
More specifically, the term nuclear receptor refers to those 
proteins that are related by their amino acid sequence to the 
LBDs of the steroid hormone receptors. The paper of Laudet et 
al. (1992) (Laudet, V., H&nni, C, Coll, J., Catzeflis, F. and 
Stehelin, D. (1992) EMBO J. 11, 1003-1013) describes an align- 
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ment of nuclear receptor amino acid sequences using standard 
methodologies. Included in the definition of nuclear receptors 
used here are also those proteins not listed in Laudet et al . 
but which can be included using the methods employed by Laudet 
et al.. The term nuclear receptor also includes mutant deriva- 
tives of nuclear receptor amino acid sequences which retain 
sufficient relatedness to nuclear receptor amino acid sequences 
as to be identifiable as related using the methods employed by 
Laudet et al . 

The nuclear receptor which is fused to the recombinase protein 
is preferably a hormone receptor, e.g. a receptor recognized by 
steroids, vitamins or related ligands. Examples of suitable 
nuclear receptors are listed in reference 20, which is hereby 
incorporated by reference. Preferably, the nuclear receptor is 
a steroid hormone receptor, more preferably, a glucocorticoid, 
estrogen, progesteron, or androgen receptor. 

In the SSR-LBD fusion protein of the present invention, it is 
not required that the complete nuclear receptor is present; 
i.e. it is sufficient that the amino acids that bind the ligand 
are fused to the SSR. 

Upon binding their relevant ligand, nuclear receptors become 
active, or altered, transcription factors. The cloning of cDNAs 
encoding members of the steroid receptor family of proteins 
revealed that they share amino acid sequence homology, in 
particular in the protein domain that binds ligand. The LBD can 
be separated from the rest of the protein and fused to other 
transcription factors conferring ligand regulation onto the 
resulting fusion proteins. For the glucocorticoid and estrogen 
receptors, the domain that binds ligand has been fused to other 
transcription factors and also to oncoproteins, rendering the 
fusion proteins dependent on the relevant ligand for activity 
(19,21-26) . All characterised oncoproteins that have been 
experimentally regulated in this manner so far are also trans- 
cription factors. Transcription factors and oncoproteins are 
not enzymes in the classical sense and the regulation of their 
activities by steroids and LBDs described in references 19, 21- 
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26, is essentially similar to the role that LBDs play in regu- 
lating steroid receptors. Prior attempts to extend the regula- 
tory possibilities of the LBD fusion protein strategy beyond 
transcription factors and oncoproteins have yet to be reported. 
Attempts to regulate the enzyme activities of either fi-galacto- 
sidase or dihydrofolate reductase by fusing them with the 
glucocorticoid LBD have been unsuccessful (D. Picard, public 
seminar given at EMBL, Heidelberg, Nov. 1991) . 

The present invention extends the regulatory possibilities of 
the LBD fusion protein strategy beyond the currently documented 
transference of ligand regulation from nuclear receptors to 
other transcription factors, to include an enzyme activity, 
namely a site specific recombinase. The term "site specific 
recombinase" refers to any protein component of any recombinant 
system that mediates DNA rearrangements in a specific DNA 
locus, including site specific recombinases of the integrase or 
resolvase/invertase classes (13; Abremski, K.E. and Hoess, R.H. 
(1992) Protein Engineering 5, 87-91; Khan, E . , Mack, J.P.G., 
Katz, R.A., Kulkosky, J. and Skalka, A.M. (1991) Nucleic acids 
Res. 19, 851-860) and site-specific recombination mediated by 
intron-encoded endonucleases (Perrin, A., Buckle, M. and Dujon, 
B. (1993) EMBO J. 12, 2939-2947). Preferred recombinase pro- 
teins, which can be used in the fusion proteins according to 
the invention, are selected from the group consisting of: FLP 
recombinase, Cre recombinase, R recombinase from the Zygo- 
saccharomyces rouxii plasmid pSRl, A recombinase from the Kluy- 
veromyces drosophilarium plasmid pKDl, A recombinase from the 
Kluyveromyces waltii plasmid pKWl, any component of the X Int 
recombination system, any component of the Gin recombination 
system, or variants thereof. The term "variant" in this context 
refers to proteins which are derived from the above proteins by 
deletion, substitution and/or addition of amino acids and which 
retain their some or all of the function inherent in the pro- 
tein from which they are derived. Specifically, the variant 
could retain the ability to act as a recombinase, or it could 
retain protein/protein or protein/DNA interactions critical to 
the recombination reaction, or to the regulation of the recom- 
bination reaction. 
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In a preferred embodiment of the invention, FLP recombinase is 
fused to the LBD of the estrogen, glucocorticoid, progesterone 
or androgen receptors (20) . Other preferred embodiments include 
fusing Cre recombinase or R recombinase, or SSRs from Kluyver- 
omyces drosophilarium or Kluyveromyces waltii, to these LBDs. 
Another preferred embodiment involves regulating one or more 
components of an SSR complex to these LBDs, in particular, com- 
ponents of the X Int or Gin recombination systems. The inven- 
tion, in providing a means to regulate recombination, is how- 
ever not limited to known recombinases and recombination com- 
plexes and is not limited to known nuclear receptor LBDs. Rat- 
her, the strategy of fusing recombinases, or components of re- 
combination complexes, to LBDs of nuclear receptors is appli- 
cable to any fusion combination of these proteins which display 
the desired characteristics readily identifiable without undue 
experimentation on the part of a skilled person. 

A further subject-matter of the present invention is a nucleic 
acid which encodes the fusion protein according to the present 
invention. Preferably, the nucleic acid is a DNA or RNA. 

Still a further subject-matter of the present invention is a 
recombinant vector comprising at least one copy of the nucleic 
acid as defined above. This recombinant vector may be a euka- 
ryotic vector, a viral vector, or a prokaryotic vector, or a 
vector which can be maintained in eukaryotic and prokaryotic 
host cells. The recombinant vector is obtainable by inserting a 
nucleic acid encoding a SSR-LBD fusion protein into a suitable 
starting vector. Specific examples of suitable starting vectors 
are given for example in Molecular Cloning. A Laboratory Manu- 
al, 2nd edition, J. Sambrook et al. (1989), Cold Spring Harbor 
Laboratory Press, chapters 1, 2, 3, 4, 16 and 17. 

The vector of the present invention may be an extra- chromosomal 
vector or a vector which integrates into the genome of the host 
cell. Extra -chromosomal vectors, e.g. circular plasmids are 
especially preferred. An example of a plasmid vector is the 
plasmid pHFEl comprising a nucleic acid encoding a fusion of 
the FLP recombinase and the ligand binding domain of the estro- 
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gen receptor. The plasmid pHFEl has been deposited at the 
Deutsche Sammlung von Mikroorganismen und Zellkulturen GmbH 
(DSM) , Mascheroder Weg lb, D-38124 Braunschweig on June 25, 
1994 under the accession number DSM 9265 according to the 
requirements of the Budapest Treaty. 

Still a further subject-matter of the present invention is a 
cell containing a nucleic acid or a recombinant vector as defi- 
ned above. Preferably, the cell is stably transformed with the 
nucleic acid or the vector. Suitable cells are eukaryotic or 
prokaryotic cells. Examples of prokaryotic cells are gram-nega- 
tive bacterial cells, especially E.coli cells. Examples of 
eukaryotic cells are mammalian cells, yeasts and trypanosomes . 

The invention separates the function of ligand binding from the 
functions of transcription factors, coupling ligand binding to 
recombinase activity. Thereby, the binding of ligand can be 
assessed by any means that measures recombinase activity. Thus, 
further embodiments of the invention include methods for deter- 
mining the binding of ligand to the LBD of a nuclear receptor, 
comprising the steps of: 

(i) the introduction of the SSR-LBD fusion protein or the nu- 
cleic acid coding therefor into cells, or appropriate cell-free 
systems, that contain the DNA target sites for the SSR; (ii) 
administering the ligand or a mixture suspected to contain a 
ligand or ligands to be evaluated; if the ligand is not already 
present in the system; (iii) detecting the recombinase activi- 
ty, if any, of the SSR-LBD by detecting, directly or indirect- 
ly, recombination or changes in the recombination rate between 
the DNA target sites. 

The introduction of SSR-LBD fusion proteins into cells may be 
accomplished by transforming the cells with appropriate vectors 
containing a nucleic acid coding for the SSR-LBD fusion pro- 
tein. The administration of the ligand to be evaluated is not 
necessary in systems which already contain the ligand and in 
which the ligand-concentration is determined by detecting re- 
combination between DNA target sites. 
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Particularly preferred embodiments include: 

(a) direct measurements of the physical arrangement of the tar- 
get sites using techniques such as gel electrophoresis of DNA 
molecules, Southern blotting or polymerase chain reaction. 

(b) indirect measurements by assessing the properties encoded 
by the DNA regions carrying the target sites before or after 
recombination. For example, recombination could excise an anti- 
biotic resistance gene from the genome of the host and thus 
recombination can be measured as sensitivity to the antibiotic. 

(c) the measurement of intracellular ligand concentrations by 
evaluating recombination between the DNA target sites resident 
in the cells that also carry the SSR-LBD fusion proteins. 

(d) the evaluation of ligand binding to an LBD without the need 
to use radiolabelled ligand derivatives or without reliance on 
the transcription factor properties of the corresponding nu- 
clear receptor. 

(e) evaluating the effect of mutations in the LBD on ligand 
binding. Since SSR-LBD recombination requires ligand, mutations 
in the LBD that decrease ligand binding can be ascertained. 
Alternatively, mutations that improve binding of a different 
ligand can be selected. For example, ligand- dependent recombi- 
nation could rearrange the DNA, so that an antibiotic resis- 
tance gene is expressed. Cells with the rearranged DNA will 
grow under the appropriate antibiotic selection, cells with 
unrearranged DNA will not. Specifically this describes a method 
for determining the effect of mutations in the LBD of a nuclear 
receptor on its ability to bind ligand, comprising the steps of 
(a) introducing mutations into the LBD of the SSR-LBD fusion 
protein, and (b) following steps (i) to (iii) above. 

The invention also encompasses a method for regulating the re- 
combination of DNA target sites, comprising the steps of: 

(a) providing cells or appropriate cell-free systems that con- 
tain DNA target sites for a site-specific recombinase, 

(b) contacting the DNA target sites with a fusion protein ac- 
cording to the present invention, which contacting may be ac- 
complished by direct introduction of the protein or by trans- 
formation of a cell with the nucleic acid encoding the protein 
and expressing the nucleic acid to produce the protein, 
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(c) effecting a recombination of said DNA target sites by con- 
tacting the fusion protein with a ligand for the nuclear recep- 
tor component of the fusion protein, e.g. by adding the ligand 
to the system at a predetermined time, or by effecting produc- 
tion of the ligand in the system at a predetermined time. 

Description of the Figures 

Figure 1A depicts the plasmid, pNEOSGAL (thin line) , integrated 
into the genome of 293 cells (thick line). FLP recombinase tar- 
get sites (FRTs) are depicted as broad arrows and the neomycin 
resistance gene lies between the two FRTs. Upon estradiol admi- 
nistration, the DNA between the FRTs is excised, leaving one 
FRT in the genome and one on the excised circular DNA. The 
probe used for the Southern blot of Fig. 2 is also depicted. 

Figure IB depicts the steps involved in the creation of plasmid 
pHFEl . 

Figure 2 shows a Southern blot of a timecourse of recombination 
occurring in the cell line PI. 4. At the times indicated after 
estradiol, or ethanol only, administration in the upper part of 
the figure, cells were lysed, DNA purified, restricted with 
Ndel, run on a 1.25% agarose gel and blotted to Biodyne B mem- 
branes by standard methodologies. The membrane was hybridized 
with the radioactively labelled probe depicted in Fig. 1A. The 
upper band shows the unrecombined integrated DNA and the lower 
the recombined band. Estradiol was dissolved in ethanol and the 
extreme right hand lane shows cells treated with the equivalent 
amount of ethanol, without estradiol, for 51 hours. 

Figure 3 shows the complete amino acid sequence of the SSR-LBD- 
fusion polypeptide encoded by the plasmid pHFEl. The FLP recom- 
binase domain is located from amino acid 1-423. The linker 
peptide is located from amino acid 424-428. The estrogen bin- 
ding domain is located from amino acid 429-773. 

Figure 4 shows the nucleic acid sequence coding for the SSR-LBD 
fusion polypeptide of Figure 3. The parts coding for the FLP 
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recombinase domain, the linker peptide and the estrogen binding 
domain, respectively, are specified. 

The following examples are offered by way of illustration and 
is not intended to limit the invention in any manner. 

Example 1 

Construction of plasmid containing a SSR-LBD- fusion gene 

The experiment involves two plasmids, one of which is the tar- 
get for recombination, and the other is the recombinase expres- 
sion plasmid. As the target plasmid, pNEOBGAL (9) was employed. 
pNEOfiGAL contains two FLP recombinase target sites (FRT) sur- 
rounding a gene encoding neomycin resistance (Fig. la) . The 
expression plasmid, pHFE 1, carries two genes, one encoding an 
FLP recombinase-estrogen receptor LBD fusion protein and the 
other encoding hygromycin resistance (Fig. lb) . The plasmid was 
constructed by standard cloning procedures using pBluescribe 
(Stratagene) , pOG 44 (9), pHE63 (22) and pCNH2 (27). The stop 
codon of the FLP recombinase coding region present in p0G44 was 
mutated by oligonucleotide replacement to introduce BamHl, 
BsiWl and EcoRl sites and continue the open reading frame. The 
BamHI site in the estrogen receptor encoded by pHE63 was used 
to join the coding regions of FLP recombinase and estrogen 
receptor LBD. All of the estrogen receptor coding region car- 
boxy to the BamHI site is present in the plasmid, including its 
stop codon. 



Example 2 

Regulation of site specific recombination 

pNEOSGAL was introduced into 293 human embryonal kidney cells 
by electroporation (5 x 10 6 cells in 500 ill phosphate-buf f ered 
saline with 5 fig of Apal linearised plasmid; 300 V at 125 fiF in 
a 1 ml cuvette using a Bio-Rad Gene Pulser) and selection for 
neomycin resistance (G418, 0.4 mg/ml) . Single colonies of resi- 
stant cells were cloned and characterised for incorporation of 
target plasmid DNA by Southern blotting. A clone, PI, showing 
integration of a single copy of target plasmid DNA was selected 
and transfected, using the calcium phosphate procedure, with 
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the second plasmid, pHFE 1 (1 |ig of XmnI linearised pHFE 1 was 
precipitated by the standard calcium phosphate method onto 10 7 
PI cells which were cultured in DMEM without phenol red, 10% 
charcoal stripped fetal calf serum) . Four clones resistant to 
both neomycin and hygromycin (0.4 mg/ml G418, 0.4 mg/ml hygro- 
mycin B) were isolated and the dependance of recombination on 
estradiol administration was observed in all four. For clone 
PI. 4, a time course of recombination in the presence of 10* 6 M 
estradiol is shown (Fig. 2) . 
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Claims 

1 . A fusion protein, comprising a recombinase protein or a 
component of a recombinase complex, fused to part or all of 
a nuclear receptor so that the amino acids that bind the 
ligand of said nuclear receptor are included, such that, in 
cells or appropriate cell-free systems: 

(a) recombinase activity is inhibited in the absence of 
ligand binding to said ligand binding domain, and 

(b) recombinase activity is induced or altered by binding of 
ligand to said ligand binding domain. 

2. The fusion protein of claim 1, 

wherein the nuclear receptor is a hormone receptor. 

3. The fusion protein of claim 1 or 2, 

wherein the nuclear receptor is a steroid hormone receptor. 

4. The fusion protein of any one of the claims 1-3, wherein the 
nuclear receptor is a mutated derivative of a nuclear 
receptor such that it retains the characteristics of the 
fusion protein of claim 1. 

5. The fusion protein of any of the claims 1-4, 

wherein the nuclear receptor is a vertebrate glucocorticoid, 
estrogen, progesteron or androgen receptor. 

6. The fusion protein of any one of the claims 1-5, 
wherein the recombinase protein or component of a recombi- 
nase complex is selected from the group consisting of: FLP 
recombinase, Cre recombinase, R recombinase from the Zygo- 
saccharomyces rouxii plasmid pSRl, A recombinase from the 
Kluyveromyces drosophilarium plasmid pKDl, A recombinase 
from the Kluyveromyces waltii plasmid pKWl, any component of 
the X Int recombination system, any component of the Gin 
recombination system, or a variant thereof. 

7. The fusion protein of any of claims 1-6, 
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wherein the recombinase protein or compound of a recombinase 
complex is fused to the nuclear receptor or ligand binding 
domain thereof by means of a genetic fusion. 

8. The fusion protein of any of claims 1-7, 

wherein said nuclear receptor or ligand binding domain the- 
reof is fused to the N- or C-terminus of said recombinase 
protein or component of a recombinase complex. 

9. The fusion protein of any of claims 1-8, 

wherein said nuclear receptor is fused to said recombinase 
protein or component of a recombinase complex through a 
peptide linker. 

10. The fusion protein of any of claims 1-9, 

which comprises the FLP recombinase and the ligand binding 
domain of the estrogen receptor. 

11. A nucleic acid which encodes the fusion protein of any one 
of the claims 1-10. 

12. The nucleic acid of claim 11, which is a DNA or RNA. 

13 . A recombinant vector comprising at least one copy of the 
nucleic acid of claims 11 or 12. 

14. The vector of claim 13, which is a plasmid. 

15. The plasmid pHFEl (DSM 9265) 

16. A cell containing a nucleic acid of claims 11 or 12 or a 
recombinant vector of any one of the claims 13-15. 

17. The cell of claim 16, which is a mammalian cell. 

18 . A method for determining the binding of a ligand to the 
ligand binding domain of a nuclear receptor, comprising the 
steps of : 
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(a) the introduction of the fusion protein of claims 1-9, or 
the nucleic acid of claims 10 or 11 into cells, or appropri- 
ate cell -free systems, that contain the JDNA target sites for 
recombination, 

(b) optionally administering the ligand or a mixture suspec- 
ted to contain a ligand or ligands, to be evaluated, 

(c) detecting the recombinase activity, if any, of said . 
fusion protein by detecting recombination or changes in the 
recombination rate between said DNA target sites. 

19. The method of claim 18, 

wherein the recombination between said DNA target sites is 
detected by direct measurement of the physical arrangement 
of said target sites. 

20. The method of claim 18, 

wherein the recombination between said DNA target sites is 
detected by assessing the properties encoded by the DNA 
regions carrying the target sites before or after recombina- 
tion. 

21. A method for determining the effect of mutations in the 
ligand binding domain of a nuclear receptor on its ability 
to bind ligand, comprising the steps of: 

(a) introducing mutations into the ligand binding domain of 
the fusion protein of claims 1-10, 

(b) following steps (a) to (c) of claim 18. 

22. A method for regulating the recombination of DNA target 
sites, comprising the steps of: 

(a) providing cells or appropriate cell-free systems that 
contain DNA target sites for a site specific recombinase, 

(b) contacting the DNA target sites with a fusion protein 
according to any one of the claims 1-10, and 

(c) effecting recombination of said DNA target site by con- 
tacting the fusion protein with a ligand for the nuclear 
receptor component of said fusion protein. 
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FIG. IB 
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FIG. 2 
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Fig. 3 

FLP/EBD AMINO ACID SEQUENCE AS ENCODED BY PHFE1 
1 MPOJFDILCKT PPKVLVRQFV ERFERPSGEK IALCAAELTY LCWMTTHNGT 
5 1 AIKRATFMSY NTHSNSLSL DIVNKSLQFK YKTQJCATILE ASLKKLIPAW 
101 EFTIIPYYGQ. KHQSDITDIV SSLQLQFESS EEADKGNSHS KKMLKALLSE 
15 1 GESIWEITEK ILNSFEYTSR FTKTKTLYQF LFLATFINCG RFSDIKNVDP 
201 KSFKLVQNKY LGVIIQCLVT ETKTSVSRHI YFFSARGRID PLVYLDEFLR 
25 1 NSEPVLKRVN RTGNSSSNKQ. EYQLLKDNLV RSYNKALKKN APYSIFAIKN 
301 GPKSHIGRHL MTSFLSMKGL TELTNWGNW SDKRASAVAR TTYTHQJTAI 
351 PDHYFALVSR YYAYDPISKE MIALKDETNP IEEWQHIEQL KGSAEGSIRY 
401 PAWNGHSQE VLDYLSSYIN RRI 

- FLP ENDS HERE - 
424 SVRGS 

- LINKER PEPTIDE ENDS HERE - 

429 MK GGIRKDRRGG RMLKHKRQRD 

45 1 DGEGRGEVGS AGDMRAANLW PSPLMIKRSK KNSLALSLTA DQMVSALLDA 
501 EPPILYSEYD PTRPFSEASM MGLLTNLADR ELVHMINWAK RVPGFVDLTL 
55 1 HDQVHLLECA WLEILMIGLV WRSMEHPVKL LFAPNLLLDR NQGKCVEGMV 
601 EIFDMLLATS SRFRMMNLQp EEFVCLKSII LLNSGVYTFL SSTLKSLEEK 
65 1 DHIHRVLDKI TDTUHLMAK AGLTLQQQHQ. RLAQLLLILS HIRHMSNKGM 
701 EHLYSMKCKN WPLYDLLLE MLDAHRLHAP TSRGGASVEE TDQSHLATAG 
75 1 STSSHSLQKY YITGEAEGFP ATV 

- ESTROGEN BINDING DOMAIN ENDS HERE - 
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Fig. 4/1 

CODING SEOJJENCE OF THE FLP/EBD FUSION PROTEIN ENCODED BY 
PHFE1 

ATfiCCACAATTTGATATATTATGTAAAACACCACCTAAGGTGCTTGTTCGT 

TGCT^TG^CTAACCTATTTATGTTGGATGATTAC 

^TC^GAGAGCCACATrCATGAGCTATAATACTATCATAAGCAATT 

TCAGTTC^CGATATTGTCAACAAGTCACTGCAGTTTAAATACAAGACGCAA 

I^gSac^totgg^gcctcattaaagaa^ 

ATATTGTAAGTAGTrTGCAATTACAGTTCGAATCATCGGAAGAA^^^ 

^GGGAA^AGCCACAGTAAAAAAATGCTTAAAGCACTTCTAAGTGAGG 

GTGMA^TCTGGGAGATC^CTGAGAAAATACTAAATO 

ACTTOAGAmACAAAMCAAAMCTTTATACCAATO^ 

TACTTTCATCAATTGTGGAAGATTCAGCGATATTAAGAACGTTGATCCGA 

AATCATTTAAATTAGTCCAAAATAAGTATCTGGGAGTAATAATCCAGTG 

T^AGTGACAGAGACAAAGACAAGCGTTAGTAGGCACATATACrrCTTTA 

G^GCAAGGGGTAGGATCGATCC^CTTGTATATITGG^ 
ATTCTGAACCAGTCCTAAAACGAGTAAATAGGACCGGCAATTCTTCAAGC 
AAO^AG^GGAATACCAATTATTAAAAGATAACTTAGTCAGATCGTACA 
ACAAAGCTTTGAAGAAAAATGCGCCTTATTCAATCTTTGCTATAAAAAA 

TGGCCCAAAATCTCACATTGGAAGACATT^ 

GAAGGGCCTAACGGAGTTGACTAATGTTGTGGGAAATTGGAGCGATAA 

TGCTTCTGCCGTGGCCAGGACAACGTATACTCATCAGATAACAGCAATACCT 

GATCACTACTTCGCACTAGTTTCTCGGTACTATGCATATGATCCAATATCA 

^GGAAATGATy^CATTGAAGGATGAGACTAATCCAATTGAGGAGTGGC 

AGCATATAGAACAGCTAAAGGGTAGTGCTGAAGGAAGCATACGATACCCC 

GCATGGAATGGGATAATATCACAGGAGGTACTAGACTACCTTTCATCCTAC 

ATAAATAGACGCATA 
-FLP ENDS HERE- 
TCCGTACGCGGATCC 

- SYNTHETIC LINKER SEQUENCE ENDS HERE - 

ATGAAAGGTGGGATACGAAAAGACCGAAGAGGAGGGAGAATGTTGAAAC 

ACAAGCGCCAGAGAGATGATGGGGAGGGCAGGGGTGMGTGGGGTCTGCTG 

GAGACATGAGAGCTGCCAACCTTTGGCCAAGCCCGCTCATGATCAAACGCTC 

TAAGAAGAACAGCCTGGCCTTGTCCCTGACGGCCGACCAGATGGTCAGTGCCT 

TOTGGATGCTGAGCCCCCCATACTCTATTCCGAGTATGATCCTACCAGACCC 

TTCAGTGAAGCTTCGATGATGGGCTTACTGACCAACCTGGCAGACAGGGAGC 

TGGTTCACATGATCAACrGGGCGAAGAGGGTGCCAGGCTTTGTGGATTTGAC 
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Fig. 4/2 

CCTCCATGATCAGGTCCACCTTCTAGAATGTGCCTGGCTAGAGATCCTGATG 

A^GGTCTCGTCTGGCGCTCCATGGAGCACCCAGTGAAGCTACTGTTTGCTCCT 

AACTTGCTCTTGGACAGGAACCAGGGAAAATGTGTAGAGGGCATGGTGGAG 

ATCTTCGACATGCTGCTGGCTACATCATCTCGGTTCCGCATGATGAATCTGCA 

GGGAGAGGAGTTTGTGTGCCTCAAATCTATTATTTTGCTTAATTCTGGAGTG 

TACACATTTCTGTCCAGCACCCTGAAGTCTCTGGAAGAGAAGGACCATATCC 

ACCGAGTCCTGGACAAGATCACAGACACTTTGATCCACCTGATGGCCAAGGC 

AGGCCTGACCCTGCAGCAGCAGCACCAGCGGCTGGCCCAGCTCCTCCTCATCCT 

CTCCCACATCAGGCACATGAGTAACAAAGGCATGGAGCATCTGTACAGCAT 

GAAGTGCAAGAACGTGGTGCCCCTCTATGACCTGCTGCTGGAGATGCTGGAC 

GCCCACCGCCTACATGCGCCCACTAGCCGTGGAGGGGCATCCGTGGAGGAGAC 

GGACCAAAGCCACTTGGCCACTGCGGGCTCTACTTCATCGCATTCCTTGCAAA 

AGTATTACATCACGGGGGAGGCAGAGGGTTTCCCTGCCACAGTCTGA 

- HORMONE BINDING DOMAIN ENDS HERE - 
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